An Empirical Study of the Performance of a Simulated Multithreaded Processor
نویسندگان
چکیده
Processor Martin T. Rowland and Randall A. Helzerman Intel Corporation, Design Technologies 2111 N.E. 25th Avenue JF1-71 Hillsboro, OR 97124-5961 August 23, 1995 Abstract Because dependencies limit the amount of parallelism in an instruction stream, a VLIW or superscalar processor typically cannot execute as many instructions each cycle as it has functional units. This otherwise wasted processing power can be harnessed in a multithreaded architecture, which exploits the independence of instructions from separate processes. If a multithreaded processor nds that the current process can not use a functional unit, it looks for an instruction from another ready process which can.This paper presents the results of an empirical study that measures the e ectiveness on throughput of multithreaded processing when applied to a super scalar architecture.
منابع مشابه
Memory Hierarchy Studies of Multimedia-enhanced Simultaneous Multithreaded Processors for MPEG-2 Video Decompression
This paper explores cache models for a simultaneous multithreaded processor with multimedia enhancements. We start with a wide-issue superscalar processor, enhance it by the simultaneous multithreading (SMT) technique, by multimedia units, and by an additional on-chip RAM storage. Our workload is a multithreaded MPEG-2 video decompression algorithm that extensively uses multimedia units. Variou...
متن کاملDesign and Implementation of Field Programmable Gate Array Based Baseband Processor for Passive Radio Frequency Identification Tag (TECHNICAL NOTE)
In this paper, an Ultra High Frequency (UHF) base band processor for a passive tag is presented. It proposes a Radio Frequency Identification (RFID) tag digital base band architecture which is compatible with the EPC C C2/ISO18000-6B protocol. Several design approaches such as clock gating technique, clock strobe design and clock management are used. In order to reduce the area Decimal Matrix C...
متن کاملAn Effective Hybrid Genetic Algorithm for Hybrid Flow Shops with Sequence Dependent Setup Times and Processor Blocking
Hybrid flow-shop or flexible flow shop problems have remained subject of intensive research over several years. Hybrid flow-shop problems overcome one of the limitations of the classical flow-shop model by allowing parallel processors at each stage of task processing. In many papers the assumptions are generally made that there is unlimited storage available between stages and the setup times a...
متن کاملPerformance Characterization of a Multithreaded Architecture: Where Are the Beneets?
Multithreaded architectures hold the promise of high performance through an overlap of computation and communication. This paper explores how the overlap in multithreaded execution aaects the performance of processor, memory, and network subsystems; what are the critical parameters to ensure high processor performance; and what is the performance impact of optimizations of the workload and arch...
متن کاملA Preliminary Performance Study of Architectural Support for Multithreading
This paper discusses the preliminary performance study of hybrid multithreaded execution model that combines software-controlled multithreaded system with hardware support for efficient context switching and threads scheduling. The hardware support for multithreading is augmented with a software thread scheduling technique called set scheduling, and their benefit to the overall performance is d...
متن کامل